On the Efficiency of Estimating Penetrating Rank on Large Graphs
نویسندگان
چکیده
P-Rank (Penetrating Rank) has been suggested as a useful measure of structural similarity that takes account of both incoming and outgoing edges in ubiquitous networks. Existing work often utilizes memoization to compute PRank similarity in an iterative fashion, which requires cubic time in the worst case. Besides, previous methods mainly focus on the deterministic computation of P-Rank, but lack the probabilistic framework that scales well for large graphs. In this paper, we propose two efficient algorithms for computing P-Rank on large graphs. The first observation is that a large body of objects in a real graph usually share similar neighborhood structures. By merging such objects with an explicit low-rank factorization, we devise a deterministic algorithm to compute P-Rank in quadratic time. The second observation is that by converting the iterative form of P-Rank into a matrix power series form, we can leverage the random sampling approach to probabilistically compute P-Rank in linear time with provable accuracy guarantees. The empirical results on both real and synthetic datasets show that our approaches achieve high time efficiency with controlled error and outperform the baseline algorithms by at least one order of magnitude.
منابع مشابه
Evaluation of Cell Penetrating Peptide Delivery System on HPV16E7 Expression in Three Types of Cell Line
Background: The poor permeability of the plasma and nuclear membranes to DNA plasmids are two major barriers for the development of these therapeutic molecules. Therefore, success in gene therapy approaches depends on the development of efficient and safe non-viral delivery systems. Objectives: The aim of this study was to investigate the in vitro delivery of plasmid DNA encoding HPV16 E7 gene...
متن کاملASAP : Towards Accurate, Stable and Accelerative Penetrating-Rank Estimation on Large Graphs
متن کامل
Measuring technological gap ratio of wheat production using StoNED approach to metafrontier
The aim of this paper is to use the concept of the metafrontier function to study the determination of efficiency differentials and Technological Gap Ratio (TGR) on wheat production in Khorasan Razavi province. In this study, we used the metafrontier function and group frontier based on the concept of Stochastic Nonparametric Envelopment of Data analysis (StoNED). The data used in this stud...
متن کاملEstimating Capacity Utilization in Iranian Pharmaceutical Industry: Pharmaceutical Companies on the Stock Exchange, 2008-2012
This study aims to measure the production capacity and capacity utilization in pharmaceutical industry. The capacity utilization is the ratio of actual production level to the potential production level which shows the gap between real production and production capacity. Method: Through econometric methods, the short-run translog cost function is estimated with the cost share function of produ...
متن کاملEstimating Capacity Utilization in Iranian Pharmaceutical Industry: Pharmaceutical Companies on the Stock Exchange, 2008-2012
This study aims to measure the production capacity and capacity utilization in pharmaceutical industry. The capacity utilization is the ratio of actual production level to the potential production level which shows the gap between real production and production capacity. Method: Through econometric methods, the short-run translog cost function is estimated with the cost share function of produ...
متن کامل